The WASABI Dataset: Cultural, Lyrics and Audio Analysis Metadata About 2 Million Popular Commercially Released Songs
نویسندگان
چکیده
Since 2017, the goal of two-million song WASABI database has been to build a knowledge graph linking collected metadata (artists, discography, producers, dates, etc.) with generated by analysis both songs’ lyrics (topics, places, emotions, structure, and audio signal (chords, sound, etc.). It relies on natural language processing machine learning methods for extraction, semantic Web frameworks forrepresentation integration. describes more than 2 millions commercial songs, 200K albums 77K artists. can be exploited music search engines, professionals (e.g. journalists, radio presenters, teachers) or scientists willing analyze popular published since 1950. is available under an open license, in multiple formats online source services including interactive navigator, REST API SPARQL endpoint.
منابع مشابه
All About Audio Metadata
Metadata, the “data about the audio data” that travels along with the multichannel audio bitstream in Dolby Digital, makes life easier for broadcasters while also increasing the creative ability of audio mixers. For broadcasters, audio metadata now means they have a set-and-forget solution, instead of monitoring, compressing, and adjusting levels all over the plant. For audio mixers, this means...
متن کاملGenre Classification of Spotify Songs using Lyrics, Audio Previews, and Album Artwork
This paper is an attempt to attack the problem of genre classification of music from a variety of angles. Three different types of data (song previews, album artwork, and lyrics) are used to train three models (a Recurrent Neural Network, k-Nearest Neighbors, and Naive Bayes, respectively) and the outputs of the three are again combined to classify a given song. The combined model was able to a...
متن کاملThe Million Song Dataset
We introduce the Million Song Dataset, a freely-available collection of audio features and metadata for a million contemporary popular music tracks. We describe its creation process, its content, and its possible uses. Attractive features of the Million Song Database include the range of existing resources to which it is linked, and the fact that it is the largest current research dataset in ou...
متن کاملA Preliminary Study on a Recommender System for the Million Songs Dataset Challenge
In this paper the preliminary study we are conducting on the Million Songs Dataset (MSD) challenge is described. The task of the competition is to suggest a set of songs to a user given half of its listening history and complete listening history of other 1 million people. We focus on memory-based collaborative filtering approaches since they are able to deal with large datasets in an efficient...
متن کاملIdentifying singers of popular songs
In this paper, we propose to identify the singers of popular songs using vibrato characteristics and high level musical knowledge of song structure. The proposed framework starts with a vocal detection process followed by a hypothesis test for the vocal/non-vocal verification. This method allows us to select vocal segments of high confidence for singer identification. From the selected vocal se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-77385-4_31